AITopics | accelerated gradient clipping

Collaborating Authors

accelerated gradient clipping

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Stochastic Optimization with Heavy-Tailed Noise via Accelerated Gradient Clipping

Neural Information Processing SystemsDec-24-2025, 10:37:25 GMT

In this paper, we propose a new accelerated stochastic first-order method called clipped-SSTM for smooth convex stochastic optimization with heavy-tailed distributed noise in stochastic gradients and derive the first high-probability complexity bounds for this method closing the gap in the theory of stochastic optimization with heavy-tailed noise. Our method is based on a special variant of accelerated Stochastic Gradient Descent (SGD) and clipping of stochastic gradients. We extend our method to the strongly convex case and prove new complexity bounds that outperform state-of-the-art results in this case. Finally, we extend our proof technique and derive the first non-trivial high-probability complexity bounds for SGD with clipping without light-tails assumption on the noise.

accelerated gradient clipping, heavy-tailed noise, stochastic optimization, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Add feedback

Review for NeurIPS paper: Stochastic Optimization with Heavy-Tailed Noise via Accelerated Gradient Clipping

Neural Information Processing SystemsJan-27-2025, 12:51:18 GMT

Weaknesses: * In [71] there are several theoretical guarantees both for convex and non-convex cases. I am wondering why they are not mentioned in Table 2. On the other hand, their analysis also covers the case where the domain doesn't need to be compact. Doesn't this reduce the novelty of this paper? I am willing to increase my grade if this concern is addressed. It would be interesting to see a comparison between the results in this paper and theirs.

accelerated gradient clipping, heavy-tailed noise, stochastic optimization, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)

Add feedback

Review for NeurIPS paper: Stochastic Optimization with Heavy-Tailed Noise via Accelerated Gradient Clipping

Neural Information Processing SystemsJan-27-2025, 12:51:11 GMT

After discussion, all reviewers agree that this paper makes a good contribution to the study of clipped sgd.

accelerated gradient clipping, heavy-tailed noise, stochastic optimization, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.40)

Add feedback

Stochastic Optimization with Heavy-Tailed Noise via Accelerated Gradient Clipping

Neural Information Processing SystemsOct-11-2024, 02:10:48 GMT

accelerated gradient clipping, heavy-tailed noise, stochastic optimization, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Add feedback